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Abstract 

For a graph G, let be the number of spanning rooted forests in which vertex 
j belongs to a tree rooted at i. In this paper, we show that for a path, the /jj's can 
be expressed as the products of Fibonacci numbers; for a cycle, they are products 
of Fibonacci and Lucas numbers. The doubly stochastic graph matrix is the matrix 
F = 3 j x " , where / is the total number of spanning rooted forests of G and n is the 
number of vertices in G. F provides a proximity measure for graph vertices. By the 
matrix forest theorem, i* 1 " 1 = I + L, where L is the Laplacian matrix of G. We show 
that for the paths and the so-called T-caterpillars, some diagonal entries of F (which 
provide a measure of the self-connectivity of vertices) converge to (p^ 1 or to 1 — 
where <j> is the golden ratio, as the number of vertices goes to infinity Thereby, in the 
asymptotic, the corresponding vertices can be metaphorically considered as "golden 
introverts" and "golden extroverts," respectively. This metaphor is reinforced by a 
Markov chain interpretation of the doubly stochastic graph matrix, according to which 
F equals the overall transition matrix of a random walk with a random number of 
steps on G. 

Keywords: Doubly stochastic graph matrix; Matrix forest theorem; Fibonacci num- 
bers; Laplacian matrix; Vertex-vertex proximity; Spanning forest; Golden ratio 
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1 Introduction 

Let G = (V,E) be a simple graph with vertex set V = V(G), \V\ = n, and edge set 
E = E(G). Suppose that n > 2. 

A spanning rooted forest of G is any spanning acyclic subgraph of G with a single vertex 
(a root) marked in each tree. 
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Let fij = fij{G) be the number of spanning rooted forests of G in which vertices i and 
j belong to the same tree rooted at i. The matrix (fij) nxn is the matrix of spanning rooted 
forests of G. Let / = f{G) be the total number of spanning rooted forests of G. 

The matrix F = ^' J j" xn is referred to as the doubly stochastic graph matrix [14, 15, 24, 23] 
or the matrix of relative connectivity via forests. By the matrix forest theorem [7, 8, 5], 

F~ 1 = I + L (1) 

and 

/ = det(/ + L), (2) 

where L is the Laplacian matrix of G, i.e. L = D — A, A being the adjacency matrix of G 
and D the diagonal matrix of vertex degrees of G. Most likely, the matrix (J + L) -1 = F was 
first considered in [11]. Chaiken [4] used the matrix adj(/ + L) = (fij) nX n fo r coordinatizing 
linking systems of strict gammoids. The entry of F can be considered as a measure 
of proximity between vertices % and j in G; the entry measures the self-connectivity of 
vertex i. 

A path is a connected graph in which two vertices have degree 1 and the remaining 
vertices have degree 2. Let P n be the path with V(P n ) = {1,2, ... ,n} and E(P n ) = 
{(l,2),(2,3),...,(n-l,n)}, see Fig. 1(a). 




Figure 1: (a) the path P n ; (b) the T-caterpillar T n ; (c) the cycle C n . 

All spanning rooted forests of P4 and the spanning rooted forests in which vertex 1 
belongs to a tree rooted at vertex 2 are shown in Fig. 2, where thick dots denote roots. 
The matrix F for P4 is 

13 5 2 1 " 
5 10 4 2 
2 4 10 5 ' 
1 2 5 13 

Let T n be the graph obtained from P n by replacing the edge (1,2) with (1,3): V{T n ) = 
{1,2,..., ra} and E{T n ) = {(1,3), (2,3), (3,4), . . (ra - l,n)}, see Fig. 1(b). We call T n a 
T-caterpillar. 
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Figure 2: The spanning rooted forests in P 4 and the forests where 1 is in a tree rooted at 2. 



Let C n , n > 3, be the cyc/e on n vertices: V(C n ) = {l,2,...,n} and E{C n ) = 
{(l,2),(2,3),...,(ra-l,ra),(n,l)},Fig. 1(c). 

By (3>i)i=o,i,2,... = (0, 1, 1, 2, 3, 5, . . .) we denote the Fibonacci numbers. Sometimes, it is 
convenient to consider the subsequences of Fibonacci numbers with odd and even subscripts 
separately: 

&i = $ 2i -i, ^ = 1, 2, ... ; 
< = $ 2l , z = 0,l,2,.... 

In Section 2 we study the spanning rooted forests in paths, cycles, and T-caterpillars, in 
Section 3 the results are interpreted in terms of vertex-vertex proximity, and Sections 4 and 5 
present interpretations of the doubly stochastic graph matrix in terms of random walks and 
information dissemination, respectively. 

2 Spanning rooted forests in paths, cycles, and 
T-caterpillars 

Theorem 1 Let G be a path, G — P n . Then f = and fa = $' min( , tj) • K+i- max (i,j) f or al1 
z, j = 1, . . . ,n. 

The number f(G) of spanning rooted forests in any graph G is equal to the number of 
spanning trees in the graph G +1 , which is G augmented by a "hub" vertex adjacent to every 
vertex of G. Indeed, a bijection between the spanning rooted forests of G and spanning 
trees of G +l is established by connecting every root of every spanning rooted forest to the 
"hub" by an edge. If G = P n , then G +1 is a fan graph sometimes also called a "terminated 
ladder." The fact that the number of spanning trees in this G +1 equals $^ is familiar to 
electrical network theorists ([18], cf. [16, 17, 1]). Among others, it was obtained by Hilton [12]. 
Myers [20] proved this using the notion of weighted composition; in [3] this fact was derived 
using Chebyshev polynomials. Our aim is to give a direct combinatorial proof of Theorem 1 
in order to fully clarify the recurrence structure of spanning rooted forests in a path. Here, a 
proof of f(P n ) = &n is integrated with a proof of the expression for given in Theorem I. 

For any graph G, T{G) will denote the set of spanning rooted forests of G. 
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Proof of Theorem 1. Let jF m = JF(-P m ), let f(m) = \j 7m \, m = 1,2, Then for every 

k > 1, 

f(k + l) = \J%%\ + \f™\, (3) 

where J^ 2) = {F G jF m | (1,2) G £(F)} and ^ = \ J^ 2) . 
Let jf 1 = F,(P m ) = {F G JF™ | 1 is a root in F}. Then in (3) 

i^Si = i^i and i^i)i = i^ +1 i- (4) 

Indeed, a bijection between JF^^ and jF fc can be established by the restriction of each 
F G ^° ^ ne ver tex subset {2, . . . , &;}; a bijection between JF^ + 2 ^ and JF^ +1 is established 

as follows: for every F G J 7 ^^) obt^ 11 F' by putting F' = F if 1 is a root in F and by 
removing edge (1, 2) and marking vertex 1 as a root, otherwise. Then F' G JF^ +1 and this 
correspondence is one-to-one. 

By (3) and (4), 

f(k + l) = f(k) + f(k + l) k = l,2,..., (5) 
where /*(m) = \ Let ^ 12) = TT H ^ >2) , m = 1, 2, . . . . Using (4) we obtain 

/•(* + 1) = + 1^1,2)1 = W + 1^1 = /(*) + * = 1, 2, ••• • (6) 

Here, a bijection between J 7 ^^) and ■^ r * ^ s established by coalescing vertex 2 with the 
root 1 and collapsing edge (1, 2). 

Observe now that /(l) = 1 = and /*(1) = 1 = $' r By (5) and (6), f(k) and /*(Jfe) 
satisfy the same recurrence relations as and $' fc do, respectively. Therefore 

/(*)=** and /•(*) = **, A; = 1,2,.... (7) 

Thus, / = f{n) = To find fa, i,j = 1, . . . ,n, observe that fa counts the spanning 
rooted forests that contain the i—j path rooted at %. To obtain a spanning rooted forest, this 
path can be completed on the subset of vertices {1, . . . , min(i, j)} in f*(mm(i,j)) ways and 
on the subset of vertices {max(i, j), . . . , n} in f*(n + 1 — max(z, j)) ways. Since the ways of 
these types are all compatible, (7) implies that fa = $ / min(ij) • ®' n +i-max(i,jy n 

Theorem 1 as well as Theorem 2 below can also be proved algebraically by means of the 
matrix forest theorem (Eqs. (1) and (2)). We present combinatorial proofs here, since they 
are a bit more illuminating. However, Theorem 3 below is proved algebraically. 

Theorem 2 Let G be a cycle, G = C n with n > 3. Then f = $' n + <&' n+1 - 2 and 
fa = %. tl + K-\j-^ hj = h---,n. 

For G = C n , the augmented graph G +l mentioned above is the wheel on n + 1 vertices. 
The fact that the number of spanning trees in the wheel is $' n + — 2 is due to Sedlacek 
[21] and Myers [19]. Myers [20] proved this using identities involving weighted compositions; 
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the proof by Benjamin and Yerger [2] is based on counting imperfect matchings. A useful 
tool for solving such problems is Chebyshev polynomials, see [17, 3, 25]. Our proof of the 
identity f(C n ) = $ / n + $' t+1 — 2 presented here for completeness is based on relations between 
forests found before. The proof of Theorem 2 relies on the following lemma. 

Lemma 1 For n > 2, let JF™„ = {F G .F(Pn) I 1 an d n are roots in F}. Then |jF™J = 

Proof. A bijection between JF™„ and the set ,F(P n _i) of spanning rooted forests in P n _i can 
be established as follows. For every F G JF(-P„_i) define F' as the spanning subgraph of P n 
whose roots satisfy two conditions: 

(1) vertex % is a root in F' iff [i — 1 or % — n or (i — 1, i) G" E(F)}; 

(2) (z, i + 1) G P(F') iff i is not a root in F. 

In this case, F' is a spanning rooted forest of P n . Indeed, if one assumes that some tree 
in F' has no root or has more than one root, then this would imply the presence of a tree 
with more than one root or no root in F, respectively. Furthermore, every F' G JF™* has a 
pre-image in .F(Pn-i), and this correspondence is by definition one-to-one. 

Consequently, by (7), = \F(P n -i)\ = K-v □ 

Proof of Theorem 2. Let Tij be the set of spanning rooted forests of C m in which j 
belongs to a tree rooted at i. Observe that 

fij = \^t~,j\ + l-T^jl > ( 8 ) 

where J*^ = {F G JF£ | (n, 1) £ E(F)} and J?^ = {F G JF£ | (n, 1) G E(F)}. 

We now show that I^Ljl = ^u-i\- Every forest in JF™^ contains the path P n+ i_|j_j| 
formed by the vertices in the sequence (max(z, j), . . . , n, 1, . . . , min(z, j)) and the edges be- 
tween the neighboring elements in this sequence. The ways of completing this path to obtain 
a spanning rooted forest in C n can be put into a one-to-one correspondence with the elements 
of the set jf]* defined in Lemma 1. Indeed, linking each F G Jf]* - *' -1-1 with P n+ i_y_i\ by 
replacing the vertices 1 and \j — i\ + 1 of F with vertices i and j of P n +i~\j~i\, respectively, 
produces a spanning rooted forest of C n , and every spanning rooted forest of C n can be 
uniquely obtained in this manner. Thus by Lemma 1, |jF^.| = \J& = Similarly, 

= l^ 1 " 1 "'""! = K-\j-i\- Therefore by (8) it follows that f tj = + 
Let T m = j(C m ), m > 3. Then 

f(C n ) = 1^(1^)1 + |jF(i,„)*| + |.F*(i )n )|, (9) 
{F G JF" 1 | (1, n) E(F)}, 

{F G T m | (L,n) G P(F) and the path joining 1 with the root contains n}, 
{F G jF m | (1, n) G E(F) and the path joining n with the root contains 1}. 

Obviously, (jF^I = |jF(P n )|, so, by Theorem 1, (jF^I = Consider any F G .F(i,n)*- 
Removing (l,n) from E(F) and marking 1 as a root produces a forest F' G ,F*(P n ), where 



where 

^(l,n)* = 
.F*(l,n) = 
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J-~*{Pn) = {F G jF(P n ) | 1 is a root in F} was defined in the proof of Theorem 1. All elements 
of ^(Pn) can be obtained in this way, except for the whole path P n rooted at 1. That is 
why |jF ( i,n)*| = \F*{P n )\ - 1 and, by (7), |jF(i >n) *| = & n - 1. Similarly, \K(i, n )\ = & n - 1. 
Substituting this in (9) provides 

f(C n ) = < + 2« -!) = < + < +1 - 2. □ 



Now recall that 



where $_i = 1, are the Lucas numbers: (Aj^^i^,... = (2,1,3,4,7,11,18,29,47,...), see, 
e.g., [13]. The Lucas numbers satisfy the same recurrence as the Fibonacci numbers do: 

A + A m = A +2 , i = 0, 1, 2, ... , 

but some other properties of the Lucas numbers are even more elegant than those of the 
Fibonacci numbers. 

By Theorem 2, f(C n ) = A2 n — 2. The numbers of forests in a cycle can also be expressed 
via smaller Fibonacci and Lucas numbers, viz. Corollary 1 holds. 1 

Corollary 1 Let G be a cycle, G = C n with n > 3. Then 

jA 2 n , n = 2k-l J$ t A„, n = 2fc-l .._ 

; \5<e n = 2k ' hi \k t <S> n , n = 2k ' h3 '-"^ 

where t = \n — 2 \j — i\ | . 

Corollary 1 is derived from Theorem 2 by means of classical identities involving Fibonacci 
and Lucas numbers. It provides a simple expression for the entries of the doubly stochastic 



graph matrix F = -^-p- of C n : 



(M 

(/«) 



Corollary 2 The entries of the doubly stochastic matrix F = ^j-^- of C n (n > 3) are: 

fij f^/An, n = 2k-l . 

— r = < , i, 7 = 1, . . . , n, 10 

/ \V5$ n , n = 2k V J 

where t = \n — 2 \j — i\ . 

In the expression (10), for every row of F, the numerators make up a segment of a fixed 
symmetric two-sided sequence: for odd n this sequence is (. . . , 34, 13, 5, 2, 1, 1, 2, 5, 13, 34, . . .); 
for even n it is (. . . , 47, 18, 7, 3, 2, 3, 7, 18, 47, . . .). Thereby the ratio of two corresponding 
elements of F is the same for all n of the same parity. 

Regarding the T-caterpillars, we are mainly interested in the total number / of spanning 
rooted forests and the diagonal entries ^33 and f nn of the matrix of spanning rooted forests. 



A knot theory interpretation of the squareness of — 2 when n is odd can be found in [22] . 
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Theorem 3 LetG be a T- caterpillar, G = T n . Then f ' = 4$' n „ 1 , /33 = 4$^_ 2 , and f n 



Proof. Observe that $' ' = 0, 4>" = 1, and for i — 1, 2, 



$2 i+ 2 



= $2i + $2i+l = $2i + $2i + $2i-l 
$2i-$2t-2 = 3<-d. 



Similarly, = 1 and for i — 1, 2, 



3$: - 



i-l) 



i+1 
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z > 1 

z = 1 
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Equations F(I + L) = I (see (1)) and (13) imply 

2/nl — fn3 
2/n.2 ~ fn3 
— fnl — fn2 + 4/„3 — f n 4 
~fn3 + 3/ n 4 ~ fn5 



0. 
0. 
0. 
0. 
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(11) 



(12) 



(13) 



(14) 



fn,n—2 3f nn —\ fnn 0j 
fn,n—l ^fnn f ■ 

Note that f n i = f n 2 = 2 = 2$", since T n has exactly two spanning rooted forests where 
vertex 1 belongs to a tree rooted at n, one of them being T n with root n, the other T n with 
edge (2,3) deleted and roots n and 2. Then, by (14), f n3 = 2/ nl = 4 = 4$". Consequently, 
using (14), (11) and induction, we obtain that 

fnA = 4/n3 - fnl ~ fnl = 12 = 4(3$? - 
fnS = 3/ n4 - fn3 = 4(3^' - $?) = 4^, 



fnn = 3/ re , n _! - / n ,„_ 2 = 4(3$^_ 3 - <_J 



44>' 2 ', 



(15) 
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From the last equation of (14), / = 2f nn - f n>n ^ = 4(2$£_ 2 - $^_ 3 ) = 4(<^_ 2 + $' n _ 2 ) = 
4$' n _ 1 . It remains to show that / 33 = 4$^_ 2 . Eqs. (1) and (13) imply 

— ^33 + 3/34 — /35 = 0, 
— /34 + 3/35 — fz& = 0, 
(16) 

—h,n-2 + 3/3 >n _i — fe n = 0, 
— fs,n-l + 2/3„ = 0. 

Since / 3n = / n3 = 4 = 4 < 3>' 1 , from (16) and (12) we have that / 3 „_i = 4$ 2 and, by induction, 

/33 = 4<_ 2 . □ 



3 "Golden introverts" and "golden extroverts" 

In [7, 8, 9] (see also [14, 15, 6]) F = ^' J |" x " was studied as a matrix of vertex- vertex proximity. 

f ■ 

For every graph G, F is a positive definite doubly stochastic matrix and -j- measures the 
relative strength of connections between vertices i and j in G. This proximity measure was 
referred to as the relative connectivity via forests. For some additional applications of F we 
refer to [11, 10]. 

It turns out [15, Theorem 2] that for every pair of vertices i and j such that j ^ i, 
fij < /ii/2; *y can be considered as a measure of self-connectivity of vertex i. By [14, 
Corollary 7], y- > (1 + rfi) -1 , where dj is the degree of vertex i. 

A vertex i can be called "an introvert" if fy > 0.5 (or, equivalently, /„ > X^i/y) 

and "an extrovert" if y- < 0.5 (equivalently, fa < fij)- The complete graph on three 

vertices provides an example of the boundary case where fy = 0.5 and fa = ^2j^i fij for 
every vertex i. 

Proposition 1 Let be the golden ratio, = 2 +1 - Then 

(i) For the paths P n , lim fy- = _1 ; 

n— >oo J 

(ii) For t/ie T- caterpillars T n , lim ^ = <p~ l and lim ^ = 1 — 

Proof. By Theorem 1 and Binet's Fibonacci number formula, for the paths P n , 
lim (fu/f) = lim (&JK) = B Y Theorem 3, for the T-caterpillars T n , lim (f nn /f) = 

71 — ► OO 71 — ► OO 71 — ^OO 

lim (4^_ 2 /4$' n _ 1 ) = 0- 1 and lim (/ 33 //) = lim (4$^_ 2 /4$' n _ 1 ) = <f>~ 2 = 1 - <j>-\ □ 

n— >oo n— >oo n— >oo 

Corollary 3 Let </> 6e the golden ratio, <p = v/ ^ +1 . TTien 
(z) For the paths F n , lim = 0; 

n— >oo Jit 



(ii) For t/ie T-caterpillars T n , lim = and lim 



Corollary 3 follows from Proposition 1 and the fact that F is stochastic. 

It can be shown that (ii) of Proposition 1 and (ii) of Corollary 3 remain true for the 
graphs resulting from T-caterpillars by the addition of edge (1, 2). 

In accordance with Corollary 3, as n — > oo, vertices 1 and n in a path and vertex n in a 
T-caterpillar tend to be "golden introverts" (named after the golden ratio), whereas vertex 3 
in a T-caterpillar tends to become a "golden extrovert." This provides a kind of sociological 
interpretation of Corollary 3. 

4 A random walk interpretation of the doubly 
stochastic graph matrix 

To better comprehend what exactly the results of the previous section mean, consider a 
random walk interpretation of the doubly stochastic graph matrix. 

For a graph G, consider any Markov chain whose states are the vertices of G, {1,2, ... ,n}, 
and the probabilities of all i— > j transitions with % ^ j are proportional 2 to the corresponding 
elements of the adjacency matrix of G: 

p ij = ea ij , z,j = l,...,n, i^j. (17) 

Then the diagonal elements of the transition matrix P = (pij) are determined as follows: 

p u = 1 - y^eajj, z = l, ...,n (18) 

and, in a matrix form, 

P = I-eL(G), 

where L(G) is the Laplacian matrix of G. 

The maximum value of e that guarantees correctness, i.e., the nonnegativity of the di- 
agonal entries (18) for all simple graphs on n vertices, is obviously e = (n — 1) . On the 
other hand, e = (n— l)" 1 is the only correct e that allows the self-transition probabilities pa 
to be zero. Therefore it makes sense to consider this value of e and the Markov chain with 
transition matrix 

P = j _ (n - 1)" 1 L(G) (19) 

in more detail. 

For this chain, let us examine random walks with a random number of steps. Namely, 
consider a sequence of independent Bernoulli trials indexed by 0, 1, 2, . . . with a certain 
success probability q. Suppose that the number of steps in a random walk equals the trial 

2 There are two popular methods of attaching a Markov chain to a graph. The first one is based on (17); 
for any undirected graph it provides a symmetric transition matrix with, in general, nonzero diagonal. 
The second method assumes that pij = aij/J2k=i a ik- F° r an undirected graph without loops it generally 
provides a nonsymmctric transition matrix with zero diagonal. 
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number of the first success. Then the number of steps, K, is a geometrically distributed 
random variable: 

Pr {K = k} = q(l — q) k , k = 0, 1, 2, . . . . 

Suppose that q = 1/n. For this value, the expected number of steps is n — 1, which is 
the number of edges in every spanning tree of G. Then 

If l x ' 



Pr{K = k} = -[l--) , k = 0,1,2,.... (20) 
n \ n J 

Let Q = (qij) be the matrix with entries 

q i:j = Pr{X K = j | X = i}, i,j = l,...,n, (21) 

where is the state of the Markov chain under consideration at step k, i.e., Q is the 
transition matrix of the overall random walk with a random number of steps K. 

Theorem 4 For a graph G onn vertices and the corresponding Markov chain whose transi- 
tion matrix is (19), let Q be the transition matrix (21) of the overall random walk whose num- 
her of steps is geometrically distributed with parameter 1/n. Then Q = F, where F = J j" x " 
is the doubly stochastic matrix of G. 

Proof. Since the spectral radius of P is 1, for every q such that < s < 1 

oo 

J^^^il-sP)- 1 

k=0 

holds. Consequently, using the formula of total probability, (20), (19) and the matrix forest 
theorem (1) we obtain 



Q 



k 

k 



OO OO ^ s , \ & 

£p r{ tf=*>p* = £i(l-i) p 

k=0 k=0 v 7 

f l--)p] 1 = (I + L)~' = F. n 



n \ \ n 



By virtue of Theorem 4, if a "golden extrovert" walks randomly in accordance with the 
above model, she eventually finds herself on a visit times more often than at home, whereas 
for a "golden introvert" the situation is opposite. 
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5 A concluding note: a communicative interpretation 
of the doubly stochastic graph matrix 



In closing, let us mention an interpretation of the doubly stochastic graph matrix in terms 
of information dissemination. Suppose that a sequence of information units (or ideas) are 
transmitted through a graph G. A plan of information transmission is a rooted forest 
F G JF(G): every information unit (idea) is initially injected into the roots of F; after that 
it comes to the other vertices along the edges of F. Suppose that every time a possible plan 
is chosen at random: the probability of every choice is \F(G)\ = f. Then idufsst i s the 
probability that an information unit arrives at j from root i. As a result, for a "golden 
introvert" the expected proportion of "her own" (injected straight into her mind) ideas to 
adopted ideas is <fi, whereas for a "golden extrovert" the proportion is inverse. 
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